





e.& 
OGTIGHGL SELORITY GENCY 
POG? GEGGEE G. HEGDE, HDARVLGHO 


GACO VELL 


> SEPTENEER 1b 


Declassified and Approved for Release by NSA on 








EO 3.3b(3) E 
6/50 USC 3605 





DEREK K. CRAIG 


e. . Sep 74 * CRYPTOLOG * Page 1 


EO 3.3b(3) FOPR-SEGREF-UMBRA- EO 3.3b(6) 


PL 8 6/50 USC 3605 PL 86-36/50 USC 3605 


EO 3.3b(3 





EO 3.3b(3) 
EO 3.3b(6) 


PL 86-36/50 USC 3605 





Sep 74 * CRYPTOLOG * Page 2 


FOP-SECRE-UMBRA- 


EO 3.3b(3) 
EO 3.3b(6) 


PL 86-36/50 USC 3605 








Sep 74 * CRYPTOLOG * Page 3 


FOP-SEEREFUMBRA— 


EO 3.3b(3) 


EO 3.3b(6) 
PL 86-36/50 USC 3605 





akan een 


Sep 74 * CRYPTOLOG * Page 4 


EO 3.3b(3) 
PL 86-36/50 USC 3605 








EO 3.3b(3) 
PL 86-36/50 USC 3605 


RYPTANALYSIS 
ODE RECOVERY 


by Marsorie Mountioy 


The role of the cryptanalyst in code reconstruotion overlaps that 
of the cryptolinguiet, and both are often cdvered by the ambigueus term 
"bookbreaker". If we asoept the refinement; ;proposed by Stuart Buck, 
recognized Agency authority on code reoonstyuotion, the oryptank lyst 
"breake into" a code, while the oryptolinguist goes on from thepe to 
"“paconatruct" tha codebook. < 





We have some notes prepared several ago by Marjorie Mountjoy, 
Pi cryptanalyst now retired, about what the oryptanalyst in such a case 
should know, have, or do to ensure that the whole process of cdde recon- 
struction oan butld on a reliable body of pertinent material. Here, with 
minor alterations, are the Mountjoy notes. They demonstrate onpe again 
that age ta not a true measure of validity or valiies * 


eryptanalyst and the code reconstructo: 
should work closely together. They may even - 
be the same person, but, in any case, they 
should aim at compatibility, wherever possible, 
The compatibility ig especially desirable when @. Buck adde: 
it comes to logging messages and requesting 


machine runs. - "Code reconstructore should always have 
vasy access to these materiale, especially where 
A tidy organization will desire the fol- "successor codes are concerned. The store of 
lowing tools (I say "desire," for they probably [materials and information te important, not for 
will not be permitted to store so large a the sake of hoarding, but to reduce the time 
collection): >| required to exploit a new system. In thie busi- 


*|neas, nothing 18 so wasteful as constant re- 
* | discovery of the wheel.” 





eee 


The following definitions and formulae are 
of utmost importance: 


True base: The values assigned are known to 
be the same as those used by the original 


cryptographers. 


Relative base: The values assigned have the 
Same delta relationship as those of the orig- 
inal groups; i.e., they are off by a constant 
of 1111, 2222, etc. 


Arbitrary base: The values assigned can be 
converted to the true by the addition of a 
constant amount which is purely arbitrary. 
(The Basic Cryptologic Glossary does not make 
this distinction between relative and arbi- 
trary base. It should.) 
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a) 
Repeat rate of a code: > £;(£;-1) 
isl 


N (N-1) 





an 
Delta I.c.: &= c+ > £4 (£4-1) 
. isi 





N (N-1) 


The following relationships will be ex- 
tremely useful to the analyst working! 














rec. ed d revxamined. The monome-dinome 
hypothesis was ckly established, but nét 
before the crygtolinguist had some agoniz{ng 


moments. . a 






















, the cryptanalyst must have the 
courage of his convictions and be willing to 
defend hi3 solution of transposition bust 


which yields no repetitions at all. 


I don't want to get into the details ‘of 
additive solution, but the wise cryptanafyst 
will use statistical information to guide him 
to the shortest road to recovery. The repeat 


7 : 
rate of the code (discernible from the depths) = 


Before turning the material over to the 
eryptolinguist for reconstruction of the code- 
book, the cryptanalyst is responsible for 
making sure, if possible: 


1, that the code is not already recovered; 


2. that the material is, in fact, plain code; 
should tell him whether the solution can‘be 


obtained in a few days by hand methods of 
whether it may take a year or more and a full- 
scale mechanized attack to complete his part 
ofthe code solution. He should be able-to 

3 a one-part code from the repeat rate of 
the beginning digraphs. : 


3. that the encipherment has been completely 
and correctly removed; 


4. that the material is homogeneous; 
5, that the code is reduced to true base (not 
always possible). 


The five points may seem obvious, but 1 
can cite some heartbreaking examples, as well 


as some heartening ones: 











eryptanalyst and the crypto- 
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Virginia Valaki of G5¢ commente:* wit is 
ible to have a "tidy organtakttoy! wt 
collestion Marjorie desartped. 
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O fives, @ separa 
rary ie designed a a research 
it te the reppgettory for ob- 


place for general code recon- 
Code reconstructor and cryptanalyst may struction, machine support for bookbreaking and 
not see eye to eye on machine work, particularly] the state of the art; it te a wellepring of 
on inverse frequency lists. The former often original atudtes and innovative procedures.... 
requests that the low-frequency groups be drop- | And it ie possible to have comprehensive ar~ 
ped, and the total gets lost. That is an un- |chives without burating at the seams, thanks to 
fortunate loss for the cryptanalyst whose best {microfilm and microfiche.” 
estimate of the probability of occurrence of a 
high-frequency group is rE » The cryptanalyst coneneryivece)" 
needs such information to prepare log weights 
and to assemble lists of groups with a specifi 
probability. He also needs it to compute the 
repeat rate. (Note: This problem is solved by 
the current practice of showing the relative 
frequency as well as the absolute frequency of 
a code group. Ed.) 


Both the code reconstructor and the crypt- 
analyst rely on statistical methods, though the 
former sees them as only a small part of his 
store of tools. He is locking for a one-to-one 
correspondence between code groups and words or 
phrases, and he is guided by the frequencies of 
these items. (The measure of the extent of 
that guidance varies according to the individu- 
al's role and point of view--cryptanalyst or 
eryptolinguist.)¢ 


The cryptanalyst is in more desperate need 
of statistical guidance. He is often in the 
position of a man looking for a needle in a 
haystack when the needle looks exactly like a 
Piece of hay. 


eee 


24, Buck's comments illustrate the difference 
in aesesament of the role of statistics in code 
reconetruction. He says: "Code reconstruction 
te primarily a lingutatte process, but it in- 
volves a lot more than mere language knowledge. 
The code reqonstructor te, in fact, an analyst 
in the broadest sense of the word. Besides 
language fluency, he must have an intimate 
ledge of the country under study, ite history, 
culture, governmental structure, key personal- 
ities, geography--and a host of other mattera. 
Codebooks are reconstructed through knowledge, 
not through ignorance. The aim of code recon~ 
struction is to eatablish what is provabie-- 
not what ie plaustble." 
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